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Abstract 

We consider the problem of localizing wireless devices in an ad-hoc network embedded in a d- 
dimensional Euclidean space. Obtaining a good estimation of where wireless devices are located is crucial 
in wireless network applications including environment monitoring, geographic routing and topology 
control. When the positions of the devices are unknown and only local distance information is given, 
we need to infer the positions from these local distance measurements. This problem is particularly 
challenging when we only have access to measurements that have limited accuracy and are incomplete. 
We consider the extreme case of this limitation on the available information, namely only the connectivity 
information is available, i.e., we only know whether a pair of nodes is within a fixed detection range 
of each other or not, and no information is known about how far apart they are. Further, to account for 
detection failures, we assume that even if a pair of devices is within the detection range, it fails to detect 
the presence of one another with some probability and this probability of failure depends on how far 
apart those devices are. Given this limited information, we investigate the performance of a centralized 
positioning algorithm MDS-MAP introduced by Shang et al. |SRZF03|, and a distributed positioning 
algorithm HOP-TERRAIN introduced by Savarese et al. [SLR02|. In particular, for a network consisting 
of n devices positioned randomly, we provide a bound on the resulting error for both algorithms. We 
show that the error is bounded, decreasing at a rate that is proportional to -Rcriticai/^?, where -Roiticai 
is the critical detection range when the resulting random network starts to be connected, and R is the 
detection range of each device. 
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I. Introduction 

In this paper, we address the problem of positoining (also referred to as sensor localization) when 
only a set of incomplete pairwise distances is provided. Location estimation of individual nodes is a 
requirement of many wireless sensor networks such as environment monitoring, geographic routing and 
topology control, to name only a few (for a thorough list of applications we refer the interested readers 
to IIJH0111 . HXu0 20. In environment monitoring for instance, the measurement data by the wireless sensor 
network is essentially meaningless without knowing from where the data is collected. 

One way to acquire the positions is to equip all the sensors with a global positioning system (GPS). 
The use of GPS not only adds considerable cost to the system, but more importantly, it does not work in 
indoor environments or when the received GPS signal is jammed (see HCHH 02 1 and the references therein 
for more information on this issue). As an alternative, we seek an algorithm that can derive positions 
of sensors based on local/basic information such as proximity (which nodes are within communication 
range of each other) or local distances (pairwise distances between neighbouring sensors). 

Two common techniques for obtaining the local distance and connectivity information are Received 
Signal Strength Indicator (RSSI) and Time Difference of Arrival (TDoA). RSSI is a measurement of the 
ratio of the power present in a received radio signal and a reference power. Signal power at the receiving 
end is inversely proportional to the square of the distance between the receiver and the transmitter. Hence, 
RSSI has the potential to be used to estimate the distance and it is common to assume the use of RSSI 
in distance measurements. However, experimental results indicate that the accuracy of RSSI is limited 
[PCBOO]. TDoA technique uses the time difference between the receipt of two different signals with 
different velocities, for instance ultrasound and radio frequency signals [SHS01]. The time difference is 
proportional to the distance between the receiver and the transmitter; and given the velocity of the signals, 
the distance can be estimated from the time difference. These techniques can be used, independently or 
together, for distance estimation. In an alternative approach, Angle of Arrival (AoA) can also be used to 
infer the positions of sensors [NN01]. Once a node has the angle of arrival information from three other 
nodes with known positions, we can perform triangulation to locate the wireless node. To measure the 
angle of arrival, an antenna array is required at each wireless node. 

Given a set of measurements, the problem of localization is solvable, meaning that it has a unique set 
of coordinates satisfying the given local information, only if there are enough constraints. The simplest 
of such algorithms, i.e., multi dimensional scaling (MDS) [BG05], assumes that all pairwise distances 
are known. Intuitively, it is clear that with 0(n 2 ) pairwise distances we should be able to determine 
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0{n) coordinates. However, in almost all practical scenarios such information is unavailable for two 
major reasons. First, sensors are typically highly resource-constrained (e.g., power) and have limited 
communication range. Thus, far away sensors cannot communicate and obtain their pairwise distances. 
Second, due to noise and interference among sensors, there is always the possibility of non-detection or 
completely incoherent measurements. 

Many algorithms have been proposed to resolve these issues by using heuristic approximations to the 
missing distances, and their success has mostly been measured experimentally. Regarding the mechanisms 
deployed for estimating sensor locations, one can divide the localization algorithms into two categories: 
range-based and range-free. In the range-based protocols the absolute point-to-point distance estimates are 
used for inferring the locations, whereas in the range-free protocols no assumptions about the availability 
of such information are made and only the connectivity information is provided. As a result, range-free 
algorithms are more effective in terms of stability and cost, hence more favourable to be deployed in 
practical settings. 

The theoretical guarantees associated with the performance of the existing methods are, however, of the 
same interest and complementary in nature. Such analytical bounds on the performance of localization 
algorithms can provide answers to practical questions: for example," How large should the radio range 
be in order to get the reconstruction error within a threshold?" With this motivation in mind, our work 
takes a step forward in this direction. 

We first focus on providing a bound on the performance of a popular localization algorithm MDS- 
MAP [SRZF03] when applied to sensor localization from only connectivity information. We should stress 
here that pairwise distances are invariant under rigid transformations (rotation, translation and reflection). 
Hence, given connectivity information, we can only hope to determine the configuration or the relative 
map of the sensors. In other words, localization is possible only up to rigid transformations. With this 
point in mind, we prove that using MDS-MAP, we are able to localize sensors up to a bounded error in a 
connected network where most of pairwise distances are missing and only local connectivity information 
is given. 

More precisely, assume that the network consists of n sensors positioned randomly in a <i-dimensional 
unit cube with the radio range R = o(l) and detection probability p. Let the n x d matrices X and X 
denote the true sensor positions and their estimates by MDS-MAP, respectively. Define L = I nxn — 
(l/n)l„l£ where I n xn is the identity matrix and 1„ is the all ones vector. It is not difficult to show 
that LXX T L satisfies nice properties, specifically, it is invariant under rigid transformations and if 
LXX T L = LXX T L, then X and X are equal up to rigid transformations. Therefore, we can naturally 
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define the distance between Xand X as follows: 



d inv (X,X) = - \\LXX T L - LXX T L 



n 



where || • \\p denote the Frobenius norm. Our first result establishes a bound on the error of MDS-MAP 
in terms of di nv , specifically, 



One consequence of the ad-hoc nature of the underlying networks is the lack of a central infrastructure. 
This fact prevents the use of common centralized positioning algorithms such as MDS-MAP. In particular, 
centralized algorithms suffer from the scalability problem, and generally it is not feasible for them to be 
implemented in large scale sensor networks. Other disadvantages of centralized algorithms, as compared 
to distributed algorithms, are their requirements for higher computational complexity and lower reliability; 
these drawbacks are due to accumulated information inaccuracies caused by multi-hop transmission over 
a wireless network [MFA07]. 

We then investigate an important question about whether similar performance guarantees can be 
obtained in a distributed setting where each sensor tries to estimate its own global position. As men- 
tioned above, this task cannot be accomplished unless some additional information, rather than local 
measurements, is provided. It is well known that in a (i-dimensional Euclidean space, we need to know 
the global positions of at least d + 1 sensors, referred to as anchors, in order to uniquely determine the 
global positions of the remaining sensors [NN01]. 

For the decentralized scenario, we turn our attention to analysing the performance of a popular 
localization algorithm called HOP-TERRAIN algorithm [SLR02]. This algorithm can be seen as a 
distributed version of the MDS-MAP. Similar to MDS-MAP, we prove that by using HOP-TERRAIN, 
we are able to localize sensors up to a bounded error in a connected network where most of the pairwise 
distances are unknown and only local connectivity information is given. 

More formally, assume that the network consists of n sensors positioned randomly in a (i-dimensional 
unit cube and d+1 anchors in general positions. Moreover, we let the radio range R = o(l) and denote 
the detection probability by p. We show that when only connectivity information is available, for every 
unknown node i, the Euclidean distance between the estimate X{ and the correct position Xi is bounded 



where R MDS = C d (ln(n)/n) 




by 




where Rnop = C' d (logn/n)d for some constant C' d that only depends on d. 
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TABLE I 



Distributed localization algorithm classification RLR03II 



Phase 



Robust positioning Ad-hoc positioning iV-hop multilateration 



1. Distance 



DV-HOP 



Euclidean 



Sum-dist 



2. Position 



Lateration 



Lateration 



Min-max 



3. Refinement Yes 



No 



Yes 



II. Related work 



The localization problem and its variants has attracted significant research interest in recent years. A 
general survey of the area and an overview of recent techniques can be found in [NN01] and [MF A07L 
respectively. The problem is also closely related to dimensionality reduction [RSOO] and manifold learning 
[SR03] in which the objects/data come from a high dimensional space, and the goal is to compute a 
low-dimensional, neighbourhood preserving embeddings. 

In the case when all pairwise distances are known, the coordinates can be derived by using a classical 
method known as multidimensional scaling (MDS) BBG0511 . The underlying principle of the MDS is 
to convert distances into an inner product matrix, whose eigenvectors are the unknown coordinates. In 
the presence of noise, MDS tolerates errors gracefully due to the overdetermined nature of the solution. 
However, when most pairwise distances are missing, the problem of finding the unknown coordinates 
becomes more challenging. For centralized algorithms (where all the measurements are sent to a single 
processor and the estimated positions are computed) three types of practical solutions to the above problem 
have been proposed in the literature. The first group consists of algorithms that try first to estimate the 
missing entries of the distance matrix and then apply MDS to the reconstructed distance matrix to find the 
coordinates of the sensors. MDS -MAP, introduced in HSRZF031 and further studied in HSRZF04L can be 
mentioned as a well-known example of this class where it computes the shortest paths between all pairs 
of nodes in order to approximate the missing entries of the distance matrix. The algorithms in the second 
group mainly consider the sensor localization as a non-convex optimization problem and directly estimate 
the coordinates of sensors. A famous example of this type is a relaxation to semidefinite programming 
(SDP)[BY04|. In the third group, the problem is formulated through a stochastic optimization where the 
main technique used in these algorithms is the stimulated annealing, which is a generalization of the 
Monte Carlo method in combinatorial optimization [KM V06I . EM06]. 
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Perhaps a more practical and interesting case is when there is no central infrastructure. [LR03] identifies 
a common three-phase structure of three, popular, distributed sensor-localization algorithms, namely robust 
positioning MSLR02L ad-hoc positioning HNN03I and N-hop multilateration HSPS031 . Table |l] illustrates 
the structure of these algorithms. In the first phase, nodes share information to collectively determine 
the distances from each of the nodes to a number of anchors. Anchors are special nodes with a priori 
knowledge of their own position in some global coordinate system. In the second phase, nodes determine 
their position based on the estimated distances to the anchors provided by the first phase and the known 
positions of the anchors. In the last phase, the initial estimated positions are iteratively refined. It is 
empirically demonstrated that these simple three-phase distributed sensor-localization algorithms are 
robust and energy-efficient [LR03]. However, depending on which method is used in each phase, there 
are different tradeoffs between localization accuracy, computation complexity and power requirements. 
In HNSB03L a distributed algorithm-called the Gradient algorithm- was proposed; it is similar to ad-hoc 
positioning [NN03] but uses a different method for estimating the average distance per hop. 

Another distributed approach introduced in |IFMW04] is to pose the localization problem as an 
inference problem on a graphical model and solve it by using Nonparametric Belief Propagation (NBP). It 
is naturally a distributed procedure and produces both an estimate of sensor locations and a representation 
of the location uncertainties. The estimated uncertainty may subsequently be used to determine the 
reliability of each sensor's location estimate. 

The performances of these practical algorithms are invariably measured through simulations and little is 
known about the theoretical analysis supporting their results. A few exceptions are in the following work. 
In [DJMI+06] the authors use matrix completion methods |Faz02] as a means to reconstruct the distance 
matrix. The main contribution of their paper is that they are able to provably localize the sensors up to 
a bounded error. However, their analysis is based on a number of strong assumptions. First, they assume 
that even far-away sensors have a non-zero probability of detecting their distances. Second, the algorithm 
explicitly requires the knowledge of detection probabilities between all pairs. Third, their theorem only 
works when the average degree of the network (i.e., the average number of nodes detected by each sensor) 
grows linearly with the number of sensors in the network. 

Our first result, specifically the analysis of MDS-MAP, has a similar flavour as in HDJMI+061 . We 
provide a theoretical guarantee that backs up experimental results. We use shortest paths as our primary 
guess for the missing entries in the distance matrix and apply MDS to find the topology of the network. 



In contrast to ||DJMI + 06ll , we require weaker assumptions for our results. More specifically, we assume 
that only neighbouring sensors have information about each other and that only connectivity information 



October 14, 2011 



DRAFT 



7 



is known. Furthermore, the knowledge of detection probabilities plays no role in our analysis or the 
algorithm. And last, in our analysis we assume that the average degree grows logarithmically-not linearly- 
with the number of sensors, which results in needing many less revealed entries in the distance matrix. 
In particular, the last condition is quite realistic: If the average degree grows any slower then the network 



is not even connected (more on this issue in Section IV-C). As the shortest paths algorithm works for 
both rage-free and range-aware cases, our analysis includes both and provides the first error bounds on 
the performance of MDS-MAP. 

Of particular interest are the two new results on the performance of sensor localization algorithms. In 
[JM11], Javanmard et al. proposes a new reconstruction algorithm based on semidefmite programming 
where they could establish lower and upper bounds on the reconstruction errors of their algorithm. 
Similarly, in [KOPV10], due to new advances in matrix completion methods [CR08], the authors analyse 
the performance of OptSpace [KM10], a novel matrix completion algorithm, in localizing the sensors. 
Interestingly, they did not need to adhere to the assumptions made by [DJM I + 06ll . However, they have 
a restrictive assumption about the topology of the network, specifically, sensors are scattered inside an 
annulus. 

All the above analytical results crucially rely on the fact that there is a central processor with access to 
the inter-sensor distance measurements. However, as we have mentioned earlier, centralized algorithms 
suffer from the scalability problem and require higher computational complexity. Hence, a distributed 
algorithm with similar a performance bound is desirable. In our second result, we analyse the recon- 
struction error of a distributed sensor localization algorithm. To the best of our knowledge we show for 
the first time that HOP-TERRAIN, introduced in [SLR02], achieves a bounded error when only local 
connectivity information is given. 

Finally, one of the fundamental challenges in localization problem is whether, given a set of mea- 
surements, the sensor network is uniquely localizable or not. In the noiseless setting where all the 
measurements are accurate, it was shown that the correct notion through which we can answer this 
question is the global rigidity [MWY06], a property that is easy to check (a thorough discussion of 
global rigidity and its implications for the sensor localization problem is given in [GH DTIOID . However, 
finding such a unique solution is NP-hard [ JJ05 ] . In the case of noisy distance measurements very little is 
known in this area. For instance, we do not know the fundamental limits for sensor localization algorithms 
or whether there are any algorithms with proven guarantees. From this point of view, our results narrow 
the gap between the algorithmic aspect of sensor localization and the theoretical one. In particular, we 
show that even in the presence of noise, the MDS-MAP and HOP-TERRAIN algorithms can localize 
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the nodes within a bounded error. 



The organization of this paper is as follows. In Section III we introduce the model and the notation 



used in our work. In Section IV we describe the MDS-MAP and HOP-TERRAIN algorithms and 



their common features. Our results are stated in Section [V] where we provide their proofs in Section VI 



Finally, we conclude in Section VII 



III. Model definition 

Before discussing the centralized and distributed localization algorithms in detail, we define the math- 
ematical model considered in this work. First, we assume that we have no fine control over the placement 
of the sensors that we call the unknown nodes (e.g., the nodes are dropped from an airplane). Formally, 
we assume that n nodes are placed uniformly at random in a d-dimensional cube [0, l] d . 

Additionally, we assume that there are m special sensors, which we call anchors, with a priori 
knowledge of their own positions in some global coordinate. In practice, it is reasonable to assume 
that we have some control over the position of anchors. Basically, anchors are the nodes that are planted 
on the field before any positioning takes place. 

Let V a = {1, . . . ,m} denote the set of m vertices corresponding to the anchors and V u = {m + 
1, . . . ,m+n} the set of n vertices corresponding to the unknown nodes. We use Xi to denote the random 
position of the node i and X to denote the n x d position matrix where the i-th row corresponds to Xj. 

In positioning applications, due to attenuation and power constraints, only measurements between 
close-by nodes are available. As a result, the pairwise distance measurements can be represented by a 
random geometric graph G(n + m, R) = (V, E, P), where V = V U U V a , E C V x V is a set of undirected 
edges that connect pairs of sensors that are close to each other, and P : E — » IR + is a non-negative 
real-valued function. The function P is a mapping from a pair of connected nodes (i, j) € E to a distance 
measurement between i and j. 

A common model for this random geometric graph is the disc model where node i and j are connected 
if the Euclidean distance dij = \\xi — Xj\\ is less than or equal to a positive radio range R. In formulae, 

As mentioned earlier, there are a variety of ways to measure the connectivity between two nodes, 
including time difference of arrival and RF received-signal strength (also called RF ranging). Due to 
limited resources, in all of the mentioned solutions there is a probability of non-detection (or completely 
wrong estimation). Think of RF ranging in the presence of an obstacle or in the (frequent) case of multiple 
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Fig. 1. This plot shows how the probability of detection changes as the distance between two sensor changes. 



paths. Depending on the acquisition mechanism, this may result in the absence of measurement or in 
incoherent measurements. 

Throughout this paper, to model this failure of detection, we assume that two nodes can detect each 
other with a probability that only depends on the distance d^j. Namely, (i, j) e E with probability p(dij) 
if dij < R. The detection probability p(-) : [0,R] — > [0, 1] is a non-increasing function of the distance. 
We consider a simple function parameterized by two scalar values a G (0, 1] and f3 E [0,3): 

p(z) = min (l,a ) , (1) 



.Ry 

for a G (0, 1] and f3 G [0,d). Note that this includes the disc model with perfect detection as a special 
case (i.e., a = l,/3 = 0). 

To each edge (i, j) G E, we associate the distance measurement Pij between sensors i and j. In 
an ideal case, we have exact distance measurements available for those pairs in E. This is called the 
range-based model or the range-aware model. In formulae, 

di,j if (i,j)£E, 
* otherwise, 
where a * denotes that the distance measurement is unavailable. 

In this paper, we assume that we are given only network connectivity information and no distance 



P 
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Connected 



Disconnected 




Fig. 2. This example shows the model we consider in this work. Nodes a and b are connected since they are within radio 
range R from each other. Even though the similar situation presents for b and c, they are not connected due to detection failure. 
Finally, nodes a and c are not connected because they are far apart. 



information. This is known as the connectivity-based model or the range-free model. More formally, 

p / 1 if (i,j)€E, 
! * otherwise. 

In the following, let D denote the n x n squared distance matrix where Dij = df j. By definition, 

D = atl + t n a T - 2XX T , 

where a G IR n is a vector with a, = ||a^|| 2 and l n is the all ones vector. As D is a sum of two rank-1 
matrices and a rank-d matrix, its rank is at most d + 2. 

IV. Algorithms 

In general, there are two solutions to the localization problem: a relative map and an absolute map. 
A relative map is a configuration of sensors that have the same neighbor relationships as the underlying 
graph G. In the following we use the terms configuration, embedding, and relative map interchangeably. 
An absolute map, on the other hand, determines the absolute geographic coordinates of all sensors. In 
this paper our objective is two-fold. First, we present the centralized algorithm MDS-MAP, that finds a 
configuration that best fits the proximity measurements. Then, we discuss its distributed version HOP- 
TERRAIN where its goal is for each sensor to find its absolute position. For both, we provide analytical 
bounds on the error between the estimated configuration and the correct configuration. 



October 14, 2011 



DRAFT 



11 



TABLE II 

Summary of Notation. 



II 


number of unknown sensors 


V u 


set of unknown nodes 


m 


number of anchors 


V a 


set of anchors 


R 


communication range 


In 


all ones vector of size n 


P 

* »*? 


distance measurements 


D 


estimated squared distance matrix 


dij 


Pairwise distance between nodes i and j 


In X n 


n x n identity matrix 




position of node % 


Xi 


estimated position of node i 


P 


detection probability 


X 


positions matrix 


d 


dimension 


X 


estimated positions matrix 


D 


squared distance matrix 


di,j 


shortest path between node i and j 


0(d) 


orthogonal group of d x d matrices 


\\-\\f 


Frobenius norm 


(A,B) 


Frobenius inner product 


\\-h 


spectral norm 



A. Centralized Positioning Algorithm: MDS-MAP 

For the centralized positioning algorithm, we assume that there is no anchor node in the system, namely, 
V a = (p. We define a set of random positions of n sensors X = {x\, . . . , x n }. MDS-MAP consists of 
two steps: 

Algorithm : MDS-MAP HSRZF03H 
Input: dimension d, graph G = (V, E, P) 
1: Compute the shortest paths, and let D be 

the squared shortest paths matrix; 
2: Apply MDS to D, and let X be the output. 

Shortest paths. The shortest path between nodes i and j in graph G = (V, E, P) is defined as a path 
between two nodes such that the sum of the proximity measures of its constituent edges is minimized. 
Let dij be the computed shortest path between node i and j. Then, the squared shortest paths matrix 
D £ W lXn is defined as = dfj for i / j, and for i = j. 

Multidimensional scaling. In step 2, we apply the MDS to D to get a good estimate of X, specifically, 
we compute X = MDSd(D). Multidimensional scaling (MDS) refers to a set of statistical techniques 
used in finding the configuration of objects in a low dimensional space such that the measured pairwise 
distances are preserved [BG05]. It is often used for a visual representation of the proximities between 
a set of items. For example, given a matrix of perceived similarities or dissimilarities between n items, 
MDS geometrically places each of those items in a low dimensional space such that the items that are 
similar are placed close to each other. Formally, MDS finds a lower dimensional embedding fjS that 
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Fig. 3. The shortest path between two nodes is defined in terms of the minimum number of hops multiplied by the radio range 
R. For instance, the shortest path between i and a\ is 2R. 



minimize the stress denned as 



where dij is the input similarity (or dissimilarity), dij = \\£i —Xj\\ is the Euclidean distance in the lower 
dimensional embedding, and /(•) is some function on the input data. When MDS perfectly embeds the 



In this chapter we use what is called the classic metric MDS (we refer the interested reader to HCC01L 
for the definition of other types of MDS algorithms, for instance non-metric MDS, replicated MDS, 
and weighted MDS). In classic metric MDS, /(•) is the identity function and the input dissimilarities 
correspond to the Euclidean distances such that dij = \\xi — Xj\\ for some lower dimensional embedding 
{xi}. Further, when all the dissimilarities (or pairwise distances) are measured without error, the following 
spectral method correctly recovers the lower dimensional embedding up to a rigid motion. 
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input data, we will have f(dij) = dij and the stress is zero. 
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Algorithm : Classic Metric MDS HSRZF03I 
Input: dimension d, estimated distance matrix M 
1: Compute (-l/2)LML, 

where L = I n — (l/n)l n l£; 
2: Compute the best rank-(f approximation Ud^dUj 

of (-1/2)LML; 
3: Return MDS d (M) = CT^y 2 . 

This algorithm has been frequently used in positioning applications; and in the future, whenever we say 
MDS we refer to the above algorithm. Let L be an n x n symmetric matrix such that 

L = I n - (l/n)l„l£, 

where l n G IR n is the all ones vector and I n is the n x n identity matrix. Let MDS^(D) denote the 
n x d matrix returned by MDS when applied to the squared distance matrix D. Then, in formula, given 
the singular value decomposition (SVD) of a symmetric and positive definite matrix {—1/2)LDL as 
(-1/2)LDL = UY>U T , 

MDSrf(D) = U d T}J 2 , 

where Ud denotes the n x d left singular matrix that corresponds to the d largest singular values and Y, d 
denotes the d x d diagonal matrix with d largest singular values in the diagonal. This is also known as 
the MDSLOCALIZE algorithm in ||DJMI + 06l . Note that as the columns of U are orthogonal to t n by 
construction, it follow that L ■ MDS d (D) = MDS d (D). 

It can be easily shown that when MDS is applied to the correct squared distance matrix without noise, 
the configuration of sensors are exactly recovered jDJMI + 06l . This follows from the following equality 

— (1/2)LDL = LXX T L . (2) 

Note that we only obtain the configuration and not the absolute positions, in the sense that MDSd(D) is 
one version of infinitely many solutions that matches the distance measurements D. Therefore there are 
multiple incidents of X that result in the same D. We introduce a formal definition of rigid transformation 
and related terms. 

We denote by 0(d) the orthogonal group of d x d matrices. A set of sensor positions Y € W ixd 
is a rigid transformation of X, if there exists a ci-dimensional shift vector s and an orthogonal matrix 
Q 6 0(d) such that Y = XQ + t n s T . Here Y should be interpreted as a result of first rotating (and/or 
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reflecting) sensors in position X by Q and then adding a shift by s. Similarly, when we say two position 
matrices X and Y are equal up to a rigid transformation, we mean that there exists a rotation Q and a 
shift s such that Y = XQ + l n s T . Also, we say a function f(X) is invariant under rigid transformation 
if and only if for all X and Y that are equal up to a rigid transformation we have f(X) = f(Y). Under 
these definitions, it is clear that D is invariant under rigid transformation, as for all 

Dtj = \\xi - Xjf = \\(xiQ + s T ) - feQ + s T )|| 2 , 

for any Q G 0(d) and s G R d . 

Although MDS works perfectly when D is available, in practice not all proximity measurements are 
available because of the limited radio range R. This is why, in the first step, we estimated the unavailable 
entries of D by finding the shortest path between disconnected nodes. 

B. Distributed Positioning Algorithm: HOP-TERRAIN 

Recall that HOP-TERRAIN is a distributed algorithm that aims at finding the global map. Notice 
that in order to fix the global coordinate system in a d dimensional space, we need to know the positions 
of at least d+1 nodes. As we defined before, these nodes whose global positions are known are called 
anchors. In this section we assume that we have m anchors in total, i.e., V a = {1,2, ... ,m}. Based 
on the robust positioning algorithm introduced in |SLR02], the distributed sensor localization algorithm 
consists of two steps : 

Algorithm : HOP-TERRAIN|SLR02J 

1: Each node % computes the shortest paths 

{di : a '■ a G Va] between itself and the anchors; 
2: Each node i derives an estimated position Xi 

by triangulation with a least squares method. 

Distributed shortest paths: Similarly to MDS -MAP, the first step is about finding the shortest path. 
The difference is that in the first step each of the unknown nodes only estimates the distances between 
itself and the anchors. These approximate distances will be used in the next triangulation step to derive 
an estimated position. In other words, the shortest path between an unknown node % and an anchor a in 
the graph G provides an estimate for the Euclidean distance = \\x{ — x a \\. 

We denote by d^ a the computed shortest path and this provides the initial estimate for the distance 
between the node i and the anchor a. When only the connectivity information is available and the 
corresponding graph G = (V, E, P) is defined as in the connectivity -based model, the shortest path d%^ a 
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is equivalent to the minimum number of hops between a node i and an anchor a multiplied by the radio 
range R. 

In order to find the minimum number of hops from an unknown node i G V u to an anchors a G V a in 
a distributed way, we use a method similar to DV-HOP |NN03]. Each unknown node maintains a table 
{x a , ha] that is initially empty, where x a G M. d refers to the position of the anchor a and h a to the number 
of hops from the unknown node to the anchor a. First, each of the anchors initiate a broadcast containing 
its known location and a hop count of one. All of the one-hop neighbors surrounding the anchor, on 
receiving this broadcast, record the anchor's position and a hop count of one, and then broadcast the 
anchor's known position and a hop count of two. From then on, whenever a node receives a broadcast, it 
does one of the two things. If the broadcast refers to an anchor that is already in the record and the hop 
count is larger than or equal to what is recorded, then the node does nothing. Otherwise, if the broadcast 
refers to an anchor that is new or has a hop count that is smaller, the node updates its table with this 
new information on its memory and broadcasts the new information after incrementing the hop count by 
one. 

To estimate the distances between the node and the anchors, when every node has computed the hop 
count to all the anchors, the number of hops is multiplied by the radio range R to estimate the distances 
between the node and the anchors. Note that to begin triangulation, not all the hop counts to all the 
anchors are necessary. A node can start triangulation as soon as it has estimated distances to d + 1 
anchors. There is an obvious trade-off between the number of communications and their performance. 

The above step of computing the minimum number of hops is the same distributed algorithm as 
described in DV-HOP. However, one difference is that instead of multiplying the number of hops by 
a fixed radio range R, in DV-HOP, the number of hops is multiplied by an average hop distance. The 
average hop distance is computed from the known pairwise distances between anchors and the number of 
hops between the anchors, although numerical simulations show that the average hop distance provides a 
better estimate, the difference between the computed average hop distance and the radio range R becomes 
negligible as n grows large. 

Triangulation using least squares. In the second step, each unknown node i G V u uses a set of 
estimated distances {di iCl : a G V a } together with the known positions of the anchors, to perform a 
triangulation. The resulting estimated position is denoted by ii. For each node, the triangulation consists 
in solving a single instance of a least squares problem {Ax = b) and this process is known as Lateration 
HSRB01L IILR031 . 
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Fig. 4. Multilateration with exact distance measurements (left) and with approximate distance measurements (right). Three solid 
circles denote the anchors (red) and the white circle denotes the unknown nodes. The intersection of the blue lines corresponds 
to the solution of multilateration. 



For an unknown node i, the position vector x\ and the anchor positions {x a : a £ {1, ... , m}} satisfy 
the following series of equations: 

II _ ||2 _ j2 
||£C1 Xi\\ — 1 , 

|| _ .||2 _ j2 

Geometrically, the above equalities simply say that the point Xi is the intersection point of m circles 
centred at x\, X2, ■ ■ ■ , x m (see Figure |4]). This set of equations can be linearised by subtracting each line 
from the next line. 

\\x2W 2 ~ \\xi\\ 2 + 2(xi - x 2 ) T Xi = dh-dh, 



\Xm— 1\{ ~r~ 2(x m — 1 Xm) Xi — d\ m dim—\ ■ 
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By reordering the terms, we get a series of linear equations for node i in the form Axi = for 

A g R^- 1 )^ and b G R m_1 defined as 

2(si - x 2 ) T 



,4 = 



6 W = 
°o — 



2(x m _ 1 

Iklll 2 - ||sC2|| 2 + < 2 -^,l 



km-lH 2 IkmH 2 + rf 2 m df m _ l 



Note that the matrix ^4 does not depend on the particular unknown node i and all the entries are known 

(i) 

accurately to all the nodes after the distributed shortest paths step. However, the vector b is not available 
at node i, because d^ a \ are not known. Hence we use an estimation £>W, that is defined from b$ by 
replacing di A by d^ a everywhere. Notice that di A > di^ a . As a result, the circles centred at x\, X2, ■ ■ ■ , x m 
have potentially larger radii. Therefore, the intersection between circles is no longer a single point, but 
rather a closed area. Then, finding the optimal estimation Xj of that minimizes the mean squared error 
is solved in a closed form using a standard least squares approach: 



= (A T A)~ 1 A T b^ . 



(3) 



For bounded d = o(l), a single least squares operation has complexity 0(m), and applying it n times 
results in the overall complexity of 0(nm). No communication between the nodes is necessary for this 
step. 



C. Stretch Factor: Euclidean Distance versus Shortest Path 

In general when the graph G is not connected, the localization problem is not well defined. In fact, 
there are multiple configurations resulting in the same observed proximity measures. For instance if graph 
G consists of two disconnected components, they can be placed in possibly infinitely different ways with 
respect to each other without violating any constraints imposed by G. For this reason we restrict our 
attention to the case where G is connected. 

In this work, we are interested in a scalable system of n unknown nodes for a large value of n. As n 
grows, it is reasonable to assume that the average number of connected neighbours for each node should 
stay constant. This happens, in our model, if we chose the radio range R = C/n 1 ^. However, in the unit 
square, assuming sensor positions are drawn uniformly, the random geometric graph is connected, with 
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Fig. 5. The red vertices indicate the anchors. Under the right scaling of the radio range R, the graph stays connected (left 
figure) whereas otherwise there will be nodes without any means of communication to others (right graph). 

high probability, if ttR 2 > (logn + c n )/n for c n — > oo [GK98]. A similar condition can be derived for 
generic d-dimensions as CdR d > (logn+c n )/n, where Cd < it is a constant that depends on d. Moreover, 
in case CdR d < (logn + c n )/n, not only the graph is not connected, there will be isolated nodes with 
hight probability. Since isolated nodes cannot communicate with other sensors, there is no way to find 
their shortest paths to other nodes. Consequently, both MDS-MAP and HOP-TERRAIN algorithms will 
be in trouble (see Figure [5}. Hence, instead of R = C/n l / d , we focus in the regime where the average 
number of connected neighbors is slowly increasing with n. Let i? cr iticai be the critical detection range 
where the resulting graph starts to be connected. Then we are interested in the regime R = CRoiticab 
for some positive constant C > 1 such that the graph stays connected with high probability. 

In our analysis, the key observation and the crux of the argument is to show that the shortest-path 
estimate is guaranteed to be arbitrarily close to the correct distance for large enough radio range R and 
large enough n. Once we proved this, we can then show that the MDS step (equivalently, lateration) finds 
almost correctly the relative (equivalently, global) position of the sensors. We demonstrate how the error 
in estimating the Euclidean distance will be reflected on the position estimation. The precise statements 
are given in Section [V] 

We have already discussed the complexity of MDS and lateration steps. To complete our discussion 
we need to evaluate the complexity of finding the shortest path. In the MDS-MAP algorithm we require 
that all-pairs shortest paths be found. This problem has an efficient algorithm whose complexity is 
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0(n 2 logn + n\E\) HJoh771 . For R = C(\ogn/n) l l d with constant C, the graph is sparse with \E\ = 
O(nlogn), whence the complexity is 0(n 2 log n). Contrary to MDS-MAP, in HOP-TERRAIN we 
must only compute the shortest paths between the unknown nodes and the anchors. This distributed 
shortest paths algorithm can be done efficiently with total complexity of 0(nm). 

V. Main results 

In this section we present our main results regarding the performance of MDS-MAP and HOP- 
TERRAIN algorithms. 

A. MDS-MAP 

Our first result establishes an upper bound on the error achieved by MDS-MAP when we have only 
the connectivity information as in the case of the connectivity-based model. 

Let X denote annxrf estimation for X with an estimated position for node i in the ith row. Then, 
we need to define a metric for the distance between the original position matrix X and the estimation 
X, which is invariant under rigid transformation of X or X. 

Define L = I n — (l/n)l n l^ as in the MDS algorithm. L is an n x n rank n — 1 symmetric matrix, 
which eliminates the contributions of the translation, in the sense that LX = L(X + ts T ) for all s £ R d . 
Note that L has the following nice properties: 

1) LXX T L is invariant under rigid transformation. 

2) LXX T L = LXX T L implies that X and X are equal up to a rigid transformation. 
This naturally defines the following distance between X and X. 

d m JX,X) = — 1 1 LXX T L — LXX T L 1 1 „ , (4) 

where = • A 2 -) 1 ^ 2 denotes the Frobenius norm. Notice that the factor (1/n) corresponds to 

the usual normalization by the number of entries in the summation. Indeed this distance is invariant to 
rigid transformation of X and X. Furthermore, d- mv (X, X) = implies that X and X are equal up to a 
rigid transformation. With this metric, our main result establishes an upper bound on the resulting error. 



The proof of this theorem is provided in Section VI We define 



#mds = 32 -. * . (5) 



a{n - 2) 

Theorem 5.1 (connectivity-based model): Assume n nodes are distributed uniformly at random in the 
[0, l] d hypercube, for a bounded dimension d G {2,3}. For a positive radio range R and detection 
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probability p defined in ([T]), we are given the connectivity information of the nodes according to the 
range-free model with probabilistic detection. Then, with a probability larger than 1 — 1/n 4 , the distance 
between the estimate X produced by MDS-MAP and the correct position matrix X is bounded by 

d inv (X,X)< +20R, (6) 

for R > (l/a) 1 / d i?MDS, where di nv (0 is defined in Q and -Rmds in ([5]). 

The proof is provided in Section VI The following corollary trivially follows, as for each E E, we 
have dij < R. 



Corollary 5.2 (range-based model): Under the hypotheses of Theorem 5.1 and in the case of rang- 
based model, with high probability 

dinAX,X) < R ^ +20R. 
R 

As described in the previous section, we are interested in the regime where R = C(logn/n) 1//ci for 
some constant C. Given a small positive constant 5, this implies that MDS-MAP is guaranteed to produce 
estimated positions that satisfy di nv (X, X) < 5 with a large enough constant C and a large enough n. 

When a is fixed and R = C(logn/n) 1 ' d for some positive parameter C, the error bound in (|6]) 
becomes 

for some numerical constants C\ and C<i- The first term is inversely proportional to C and a 1//rf and is 
independent of n, whereas the second term is linearly dependent on C and vanishes as n grows large. This 
is illustrated in Figure [6j which shows numerical simulations with n sensors randomly distributed in the 
2-dimensional unit square. Notice that the resulting error is inversely proportional to a and independent 
of p. 

Remark 5.3: Even though the upper bounds for both range-free and range-based models have the same 
form, there is a slight difference between their behaviours as R grows. In the range-free case, up to some 
point, the performance of MDS-MAP improves as R increases. This is due to the fact that the first 
and second terms go in opposite directions as a function of R. However, In the range-based case, as R 
increases, we obtain a more accurate estimate of the the Euclidean distance. As a result, once the radio 
range increases, the resulting error of MDS-MAP decreases and we do not see the contribution of the 
second term. This phenomenon is illustrated in Figure [7] 

Using the above theorem, we can further show that there is a linear transformation S G R dxci , such 
that when applied to the estimations, we get a similar bound in the Frobenius norm of the error in the 
positions. 
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Fig. 6. Average distance between the correct topology X and the estimation X using MDS-MAP as a function of C where 
the radio range is R = CVlogn/n. The n = 1, 000 sensors are distributed randomly on a unit square under range-free model. 
Various values of a and /3 are used where two nodes at distance r are detected with probability p(r) — min{l, a(Rfr) 13 }. 
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Fig. 7. Average error of MDS-MAP under the range-based model. 



Theorem 5.4: Under the hypotheses of Theorem 5.1 with high probability 

min -±=\\LX-LXS\\ < v'e f — + 20R 



Remark 5.5: Note that although for the sake of simplicity, we focus on [0, l] d hypercube; our analysis 
easily generalizes to any bounded convex set and homogeneous Poisson process model with density 
p = n. The homogeneous Poisson process model is characterized by the probability that there are exactly 
k nodes appearing in any region with volume A : F(k A = k) = { - £ §^e-P A . Here, k A is a random variable 
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defined as the number of nodes in a region of volume A. 

Remark 5.6: To simplify calculations, we assumed that d is either 2 or 3. However, the analysis easily 
applies to general d and only the constant in the bound §6§ would change as long as d = 0(1). 

In what follows we investigate an important question whether similar performance guarantees, as in 
MDS-MAP, can be obtained in a distributed setting. In particular, we analyze the performance of the 
HOP-TERRAIN algorithm. As we have already stressed, this algorithm can be seen as a distributed 
version of the MDS-MAP algorithm. In particular, we show that when only connectivity information is 
available, for every unknown node. The Euclidean distance between the estimate and the correct position 



can be bounded very similarly to Theorem 5.1 



B. HOP-TERRAIN 

Our second result establishes that HOP-TERRAIN [SLR02] achieves an arbitrarily small error for a 
radio range R = C(\ogn/n) l ' d with a large enough constant C, when we have only the connectivity 
information as in the case of the connectivity-based model. The same bound holds immediately for 
the range-based model, when we have an approximate measurements for the distances, and the same 
algorithm can be applied without any modification, to compute better estimates for the actual distances 
between the unknown nodes and the anchors, the extra information can be readily incorporated into the 
algorithm. We define 

« HOPS12 (i^y. (7) 



a(n-2) 

Theorem 5.7: Assume n sensors and m anchors are distributed uniformly at random in the [0, l] d 
hypercube for a bounded dimension d G {2,3}. For a given radio range hop, detection 

probability p defined in ([T]), and the number of anchors m = O(logn), the following is true with 
probability at least 1 — 1/n 4 . For all unknown nodes i G V u , the Euclidean distance between the estimate 
Xi given by HOP-TERRAIN and the correct position Xj is bounded by 

hi-XiW < ^ +24R. (8) 



The proof is provided in Section VI As described in the previous section, we are interested in the 
regime where R = C (log n/n) 1 ^ for some constant C. Given a small positive constant 5, this implies 
that HOP-TERRAIN is guaranteed to produce estimated positions that satisfy \\xi — Xi\\ < 5 for all 
i G V u with a large enough constant a and large enough n. 
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Fig. 8. Three anchors in fixed positions ([0, 0], [1, 0], [0, 1]) for a two-dimensional sensor localization. 

When the number of anchors is bounded and the positions of the anchors are chosen randomly, it 
is possible that, in the triangulation step, we get an ill-conditioned matrix A T A, resulting in an large 
estimation error. This happens, for instance, if three anchors fall close to a line. However, as mentioned 
in the introduction, it is reasonable to assume that, for the anchors, the system designer has some control 
over where they are placed. In that case, the next remark shows that when the positions of anchors are 
properly chosen, only d + 1 anchors suffice to get a similar bound on the performance. Note that this is 
the minimum number of anchors necessary for triangulation. For simplicity we assume that one anchor 
is placed at the origin and d anchors are placed at positions corresponding to d-dimensional unit vectors. 



The position of the d + 1 anchors are {[0, . . . , 0], [1,0,..., 0], [0, 1, 0, . . . , 0], [0, . . . , 0, 1] }. (see figure 



Theorem 5.8: Assume that n sensors are distributed uniformly at random in the [0, l] d hypercube for 
a bounded dimension d = {2, 3}. Also, assume that there are d + 1 anchors, one of which is placed at 
the origin, and the position vectors of the d remaining anchors are the d-dimensional unit vectors. For 



with probability at least 1 — 1/n 4 . For all unknown nodes i G V u , the Euclidean distance between the 
estimate X{ given by HOP-TERRAIN and the correct position Xi is bounded by 



8) 



a given radio range R > (1 /a) 



/ d i?HOP and detection probability p defined in ([I]) the following is true 



1 1 2 



R 



+ 48R . 



(9) 



The proof is provided in Section VI 
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Fig. 9. Average distance between the correct position {xt} and estimation using HOP-TERRAIN as a function of C, for 
R = Cy/\ogn/n with n = 5, 000 sensors in the unit square under connectivity-based model. Two nodes at distance r detect 
each other with probability p(r) = min{l, a(R/r)^}. 
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Fig. 10. Average error under range-based model. 



Remark 5.9: There is nothing particular about the position of the anchors in unit vectors. Any d + 1 
anchors in general position will give similar bounds. The only difference is that the constant term in the 
definition of Rao? changes with the anchor positions. 



Corollary 5.10 (range-based model): Under the hypothesis of Theorem 5.7 and in the range-based 
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model, with high probability 



\x i -x i \\< Bi ^ P +2AR. 

R 



The similar result holds true when sensors are places deterministically, specifically, under the hypothesis 



of Theorem 5.8 with high probability, 



\\xi - Xi\\ < 2?^- +48i? . 
R 

As it was the case for MDS-MAP, when R = C(\ogn/n) 1 l d for some positive parameter C, the error 
bound in ([9]) is 

for some numerical constants C\ and C<i- The first term is inversely proportional to C and a 1//rf and is 
independent of n, whereas the second term is linearly dependent in C and vanishes as n grows large. This 
is illustrated in Figure [9] which shows numerical simulations with n = 5, 000 sensors randomly distributed 
in the 2-dimensional unit square. We compute the root mean squared error: {(1/n) Y^t=i \\ x i ~ x i\\ 2 } 1 ^ 2 ■ 



Figure 1 1 shows a network consisting of n = 200 nodes place randomly in the unit circle. The three 
anchors in fixed positions are displayed by solid blue circles. In this experiment the distance measurements 

shows the final estimated 



12 



are from the range-based model and the radio range is y/0.8 log n/n. Figure 
positions using HOP-TERRAIN. The circles represent the correct positions, and the solid lines represent 
the differences between the estimates and the correct positions. The average error in this example is 0.075. 




Fig. 11. 200 nodes randomly placed in the unit square and 3 anchors in fixed positions. The radio range is R — ^/0.8 * log n/n. 
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Fig. 12. Location estimation using HOP-TERRAIN. 



VI. Proof of the Main Theorems 



A. Proof of Theorem 5.1 



We start by bounding the distance d- mv (X, X), as denned in Eq. (|4j), in terms of D and D. Let 
\\A\\f = (Y^i j Ajj) 1 / 2 denote the Frobenius norm of a matrix and ||A||2 = maxy-E^i H^lb denote the 
spectral norm. Note that for a rank r matrix A we have 

\\A\\ 2 < \\A\\ F < r\\A\\ 2 . 

Since L(XX T - XX T )L has rank at most 2d, it follows that 

\\L(XX T - XX T )L\\ F < V2d\\L(XX T - XX T )L\\ 2 . (10) 

To bound the spectral norm, let M = —(1/2)LDL. Then, 

\\L(XX T - XX T )L\\ 2 < \\LXX T L- M\\ 2 + \\M - XX T \\ 2 

< (l/2)\\L{-D + D)L\\ 2 + (l/2)||L(-5 + D)L\\ 2 

< \0-D\\ 2 , (11) 

where in the first inequality we used the triangular inequality and the fact that X = LX. In the second 
inequality we used (|2]) and the fact that 

\\M - XX T \\ 2 = min \\M - A\\ 2 , 

A:rank(A)<d 
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which follows from the definition of X. From the definition of X = MDSd(-D), we know that XX T is 
the best rank-d approximation to M. Hence, XX T minimizes \\M — A\\2 for any rank-d matrix A. Since 
the rank of — (1/2)LDL is d, this implies 

||M - XX T \\ 2 < \\M + (1/2)LDL\\ 2 . 



The inequality in (111 follows trivially from the observation that \\L\\2 = 1. 

Next, to bound \\D — D\\2, we use the following key result on the number of hops in graph G. The 
main idea is that, for sensors with uniformly random positions, the number of hops scaled by the radio 
range R provide estimates close to the correct distance. We define 

~ / 12 log n \ 1 

fi = 2 % . (12) 

V aln -2) J 



Lemma 6.1: (Bound on the distance estimation) Under the hypotheses of Theorem |5.1| with probability 
larger than 1 — 1/n 4 , for any pair of nodes i G V and j G V, the number of hops between nodes i and 
j is bounded by 



for R > m&x{7R, {l/a) l l d R}. 



The proof of this lemma is provided in Section VI-G The distance estimate from the first step of MDS 



MAP is dij = Rhij. The following corollary gives a bound on the estimation error. 



Corollary 6.2: Under the hypotheses of Lemma 6.1 



Proof: From Lemma 6.1 we know that 
The corollary follows from the assumption that 7R < R < 1 and d < 3. 



Define an error matrix Z = D — D. Then by Corollary 6.2 Z is element-wise bounded by 



< Zij < (30R/(14R))Dij + 8R. 

We can bound the spectral norm of Z as follows. Let u and v be the left and right singular vectors of the 
non-negative matrix Z, respectively. Then by Perron-Frobenius theorem, u and v are also non-negative. 



October 14, 2011 



DRAFT 



28 



It follows that 

\\D-D\\ 2 = u T Zv 

< (30R/(UR))u T Dv + (l T u)(t T v)8R 

< (30R/(UR))\\D\\ 2 + 8Rn 

< (30R/(UR))dn + 8Rn . (13) 

The first inequality follows from the element-wise bound on Z and the non-negativity of u and v, and the 
second inequality follows form the definition of the spectral norm and the Cauchy-Schwarz inequality. In 
the last inequality, we used \\D\\2 < dn, which follows from the fact that D is non-negative and element- 
wise bounded by d. Typically we are interested in the regime where R = o(l), and by assumption we 



know that R> R and d < 3. Therefore, the first term in ( 13 1 dominates the error. Substituting this bound 



on \\D — D\\2 in (111 proves the theorem. 



B. Proof of Theorem 5.4 



Using SVD we can write LX as U nxd Z dxd Vj xd where U T U = I dxd , V T V = VV T = I dxd and S 
is a diagonal matrix. We also denote the Frobenius inner product between to matrices A mxn and B mxn 
by 

(A,B) = ^A itJ B^. 

It is easy to show that 

(A, B) = Tr(A T B) < \\A\\ F \\B\\ F . 

In fact, this inner product induces the Frobenius norm definition. In particular, for an m x n matrix A 
we have 

||^4||f = sup (B,A). 

BeM. mxn ,\\B\\ F <l 

Now, for S = X T LUT I - 1 V T , we have 



\LX-LXS\\ F = sup (B,LX-LXS) 

l nxd ,\\B\\ F <l 



sup {B,{LXVY.U T - LXXT^U^V 7 ) 

" xd ,||B|| F <l 

sup {BVY I - 1 U T , LXX T L - LXX T L) 

nxd ,\\B\\ F <l 

sup \\BVE~ 1 U T \\ F \\LXX T L - LXX T L\\ F . 

">< d ,||B|! F <l 
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Using the fact \\A\\ F = Tr(A T A) and the cyclic property of the trace, i.e., Tv(ABC) = Tr(BCA), we 
obtain 

\\BVTr x U T \\ F = Tt(BVS" 2 V T B) < a^jBfp, 

where er m i n is the smallest singular value of LX. It remains to show that cr m i n > y/n/6 holds with 
high probability when nodes are placed uniformly at random. To this end we need to consider two facts. 
First, the singular values (and in particular the smallest singular value) are Lipschitz functions of the 
entries (See appendix). Second, we have E(LXXL) = (n/12)Idxd- By using concentration of measure 
for Lipschitz functions on bounded independent random variables, the result follows. 



C. Proof of Theorem 5. 7 



In this section we provide the proofs of the theorems 5.7 Detailed proofs of the technical lemmas are 
provided in the following sections. 

For an unknown node i, the estimation x\ is given in Eq. ([3]). 



< \\(A T A)- 1 A T \\ 2 \\bS ) -b^\\, 



(14) 



First, to bound ||6q — b®\\, we use Corollary 



6.2 



Since dfj < d for all i and j, we have 



|6«-6W| 



771—1 



72 \2 



30R 



1/2 



fc=l 



< 2Vm- 1 



UR 



d + 8R 



(15) 



Next, to bound \\(A T A)~ 1 A T \\ 2 , we use the following lemma. 

Lemma 6.3: Under the hypothesis of Theorem |5.7| the following is true. Assuming random anchor 
model in which m = fi(logn) anchors are chosen uniformly at random among n sensors. Then we have 

\\(A T Ar 1 A T \\ 2 < 



m — 1 



with high probability. 



By assumption we know that R > R and d < 3. By combining ( |14| ), ( [T5j ) and Lemma 
Theorems 15.71 



6.3 



proves 
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D. Proof of Theorem \5.8\ 



In this section we provide the proof of Theorem 5.8 Detailed proofs of the technical lemmas are 
provided in the following sections. 



Similarly to the proof of Theorem 5.7 for an unknown node i, and the estimate Xi we have 

\\xi- xi\\ < IKii^r^Hall^-ftWll, 

We have already bounded the expression ||6q' ) — &W|| in (J_5). To bound || (^4 T ^4) _1 A T ||2, we use the 
following lemma. 



Lemma 6.4: Under the hypothesis of Theorem |5.8| the following are true. We assume a deterministic 
anchor model, where m = d + 1 anchors are placed on the positions 

xi = [1,0,. ..,0], 
x 2 = [0,1,0,...,0], 

x d = [0,0,..., 0,1], 
x d+1 = [0,0,..., 0]. 

Then, 

\\{A T A)-'A T \\ 2 < d -, 



with high probability. This finishes the proof of Theorems 5.8 



E. Proof of Lemmas 6.3 (Random Model) 

As it was the case in the proof of Lemma 6.4 in order to upper bound [[(^"A) -1 ^^ we need to lower 



bound the smallest singular value of A. Let the symmetric matrix B be defined as A 7 A. The diagonal 
entries of B can be written as 

m— 1 

hi = 4^2(x k}i -x k+lji ) 2 , (16) 



k=l 

for 1 < i < d and the off-diagonal entries as 

m— 1 

bi J = 4 ^2 ( X k,i ~ Xk+l,i)(x k j ~ Xk+lj), (17) 
k=l 

for 1 < i / j < d where x k ^ is the i-th element of vector x k . In the following lemmas, we show 
that with high probability, as m increases, the diagonal entries of B will all be of the order of m, i.e., 
h,i = @(m), and the off-diagonal entries will be bounded from above by m^ +t , i.e., bij = o(m). 
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Lemma 6.5: For any e > the diagonal entries of B are bounded as follows. 



(\b iti -2(m- l)/3 1 >4mHj 



< 4e 



-rrv° 



The idea is to use Hoeffding's Inequality (see appendix [A]) for the sum of independent and bounded 
random variables. To this end, we need to divide the sum in ([16]) into sums of even and odd terms as 
follows: 

h. . — w 4- 

where 

K = 4 ( x k,i~ x k+i,if, (18) 

k Seven 

bl = (19) 



fcSodd 



This separation ensures that the random variables in summations ( 1 8 1 and (jT9j) are independent. Let the 



random variable z\ denote the term A{xu,i — Xk+i,i) 2 in ( fT8] l. Since z\ S [0, 4] and all the terms in b\ 
are independent of each other, we can use Hoeffding's Inequality to upper bound the probability of the 
deviation of b\ from its expected value: 



1 6* - (m - 1)/3| > 2m5 +e J < 2e~ m , (20) 
for any fixed e > 0. The same bound holds for b . Namely, 

P (\bi - (m - 1) /3| > 2m» + j < 2 e - m2s . (21) 

Hence, 

P (\b iti - 2(m - l)/3j > 4m5 +e j 

(|6 e - (m - 1)/3| + |6 - (rn - 1)/3| > 4to5+ £ ) 



(a) 
< 



(6) 

< 4e" m , 

where in (a) we used triangular inequality and in (b) we used the union bound. 
Lemma 6.6: For any e > the off-diagonal entries of B are bounded as follows. 

bijl > 16ni2 +t ) < 4e" 



The proof follows in the same lines as the proof of Lemma 6.5 
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Using the Gershgorin circle theorem (see appendix [A]) we can find a lower bound on the minimum 
eigenvalue of B. 

A nri n(fl)>niin(& M -.R i ), (22) 



where 



Now, let Bjj denote the event that {b^i < 2(m — l)/3 — 4m^ +e } and B^- (for i ^ j) denote the event 
that {bij > 16m2 +<E }. Since the matrix B is symmetric, we have only d(d + l)/2 degrees of freedom. 



Lemma |6.5| and 6.6 provide us with a bound on the probability of each event. Therefore, by using the 



union bound we get 

Bij J < 1 — 2^ L v L './ 

= l-3d 2 e- m2 

Therefore with probability at least 1 — 3d 2 e~ m2 ' we have 

2(m-l) 



ba-Ri>-^-z --16d-rri2 +t , (23) 



for all 1 < i < d. As m grows, the RHS of (23) can be lower bounded by (m — l)/3. By combining 



( 22 1 and ( 23 1 we can conclude that 



XMB) > ) > 1 - 3d 2 e" m2t . (24) 



As a result, from (26 1 and (24 1 we have 



P ( \\(A r A)- l A\\ 2 < \/^rrj > l-3rf 2 e- m ", (25) 
which shows that as m grows, with high probability we have IK^A) -1 ^^ < 



m— 1 ' 



F. Proof of Lemmas 6.4 (Deterministic Model) 

By using the singular value decomposition of a tall m — lxd matrix A, we know that it can be written 
as A = UT,V T where U is an orthogonal matrix, V is a unitary matrix and E is a diagonal matrix. Then, 

(A T A)~ 1 A = UY>- l V T . 

Hence, 

\\{A T A)- 1 A\\ 2 = -\- (26) 
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where a m i n (A) is the smallest singular value of A. This means that in order to upper bound || (y4 T ^4) _1 ^4||2 
we need to lower bound the smallest singular value of A. 

By putting the sensors in the mentioned positions the d X d matrix A will be Toeplitz and have the 
following form. 



A = 2 



1 -1 
1-1 













1 -1 
1 



We can easily find the inverse of matrix A. 



A' 1 



1 



1 1 1 
1 1 



••• 11 

••• 1 

Note that the maximum singular value of A' 1 and the minimum singular value of A are related as 
follows. 



&min\-A) 



1 



(27) 



&max(A t 

To find the maximum singular value of A^ 1 , we need to calculate the maximum eigenvalue of A -1 (^4 _1 ) T 
which has the following form 

d d- 1 d-2 ■■■ 1 
d-l d-l d-2 ■■■ 1 

a- 1 (A-r = \ 

2 2 2 1 

1 111 
By using the Gershgorin circle theorem (see appendix [A]) we can find an upper bound on the maximum 
eigenvalue of A^ 1 (v4 _1 ) T . 



Hence, by combining ( 26 1 and ( 28 1 we get 



\U T A)^A\\ 2 < d -. 



(28) 



(29) 
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G. Proof of the Bound on the Number of Hops 

We start by applying a bin-covering technique in a similar way as in [MP05], [QK M10I1 . HKO10L 
In this section, for simplicity, we assume that the nodes are placed in a 3-dimensional space. However, 
analogous argument proves that the same statement is true for d = 2 as well. 

For each ordered pair of nodes such that dy > R, define a 'bin' as 

Ay = {x G [0, l] 3 | R — 5 < d(x, Xi) < R, <L{xj — Xi,x — Xi) < #} , 

where 5 and 9 are positive parameters to be specified later in this section, and •) : M. d x M. d — > [0, n] 
is the angle between two vectors: 

•) = arccos(2:fz2/(||zi||||z2||))- 

We say a bin Ay is occupied if there is a node inside the bin that is detected by node i (i.e., conencted 
to node % in the graph G). Next, for each unordered pair of nodes such that dij < R, define a bin 

as 

B id = {x G [0, l] 3 | d(x, Xi) < R, d{x, xj) < R} . 

We say a bin Bij is occupied if there is a node inside the bin that is simultaneously detected by nodes 
i and j (i.e., connected to both nodes i and j in the graph G). When n nodes are deployed in [0, l] d 
uniformly at random, we want to ensure that, with high probability, all bins are occupied for appropriate 
choices of R, 5, and 9. 
First when dij > R, 

IP (-Ay is occupied) = 1 — (1 — P(node I occupies Ay)) 

> 1-fl-^J J 2vrr 2 sm((p)p(r)drd(p 

= l-(l- \a{l - CO s{9))R P ^{R^ ~(R- 5) 3 " /3 ) 

for ft G [0,3) as per our assumption. Since Ay's are constrained to be in [0, l] 3 , we need to scale the 
probability by 1/4. The above inequality is tight in the worst case, that is when both nodes i and j lie 
on one of the edges of the cube [0, l] 3 . We choose 9 such that 1 — cos(#) = (5/R) 2 . Then using the 
facts that l-z< exp(-z) and (1 - z 3 ^) < (3 - /3)(1 - z)/3 for z G [0, 1) and /3 G [0, 3), we have 

/ TTCtS^ \ 

P(Ay is occupied) > 1 — exp I (n - 2) ) , (30) 
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which is larger than 1 — 1/ra 6 if we set 5 = (12 log nj(a(n — 2))) 1 / 3 . 

Next we consider the case when nodes i and j are at most R apart. Notice that nodes i and j may 
not be directly connected in the graph G, even if they are within a radio range R. The probability that 
they are not directly connected is 1 — a(dijfR)^ 13 , which does not vanish even for large n. However, 
we can show that nodes i and j are at most 2 hops apart with overwhelming probability. The event that 
hi j > 2 is equivalent to the event that By is occupied. Then, 

IP (-By is occupied) = 1 — J^J (1 — P(node I is detected by i and j)) 

> i _ (i _ V (B itj )a 2 ) n - 2 

> l-exp{-V(B itJ )a 2 (n-2)} , (31) 

where V(By) £ M is the volume of Bij, and we used the fact that the probability of detection is lower 
bounded by a. V(Bij) is the smallest when nodes % and j are distance R apart and lie on one of the 
edges of the cube [0, l] 3 . In a 3-dimensional space, 

V(Bij) > (l/4)(5/12)^R 3 > {l/A)R\ 



Substituting these bounds in (31), we get 

P(B id is occupied) > 1 - exp {-(l/A)a 2 R 3 (n - 2)} , (32) 

which is larger than 1 - 1/ra 6 for R > ((241ogn)/((n - 2)a 2 ))^ 3 . 

For each ordered pair (i, j), we are interested in the bin Aij if dy > R and Bij if dij < R. Using 



the bounds in ( |30| ) and ( |32| ) and applying union bound on all n(n — 1) ordered pairs of nodes, all bins 

{A^ | d id > R} U {Bij | dij < R} 

are occupied with a probability larger than 1 — 1/n 4 . 

Now assuming all bins are occupied, we first show that the number of hops between two nodes i and j 
is bounded by a function F(dij) that only depends on the distance between the two nodes. The function 

F : R + —I- R + is defined as 

f 2 if z < R , 

F(z) = 

[ k + 2 if z e C k for k <G {1,2,...} , 

where Ck denotes the interval (k(R — V35) + V%5, k(R — \/35) + R]. Our strategy is to use induction 
to show that for all pairs of nodes, 

hij < F{dij) . (33) 
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First, assume nodes i and j are at most R apart. Then, by the assumption that the bin By is occupied 
there is a node connected to both i and j. Therefore the number of hops /iy is at most 2. 



Next, assume that the bound in ( |33| ) is true for all pairs (I, m) with 

di, m < V3S + k(R-V35). 

For two nodes i and j at distance dy G consider a line segment £y in the 3-dimensional space with 
one end at X{ and the other at Xj. Let y G M 3 be the point in the line segment £y that is at distance 
i? from Xj. We want to show that there exists a node that is close to y and is connected to node i. By 
definition, y is inside the bin Ay. We know that the bin Ay is occupied by at least one node that is 
connected to node i. Let us denote one of these nodes by I. Then d(y,xi) < ^/35 because 

sup d(z, y) = ^5 2 + 2R(R - 5){1 - cos(0)) < VS5. 

We use the following triangular inequality which follows from the definition of the number of hops. 

hij < hij + hij . 

Since / is connected to i we have hij = 1. By triangular inequality, we also have dij < d(y, xj)+d(y, xi). 
It follows from d(y,Xj) = dij — R and d(y,xi) < \/35 that 

dij < dij — R + V35. 

Recall that we assumed di.j < R + k(R — \/3<5). Since we assumed that (33 1 holds for di : j < V35 + 
we have 

h"i j ^ k ~~\~ 2 j 

for all nodes i and j such that dy < i? + A;(i? — \/3^)- By induction, this proves that the bound in ( [33] ) 
holds for all pairs (i, j). 

We can upper bound F(z) with a simple affine function: 

F(z) < 2 + ^— z 

R-V36 

( 25\ z 

where the last inequality is true for R > 2y3<5/(2 — Together with (33 1 this finishes the proof of 
the lemma. 

Figure 13 illustrates the comparison of the upper bounds F(dij) and F a (dij), and the trivial lower 
bound dij > dij in a simulation with parameters d = 2, n = 6000 and R = ^/64 log n/n. The 
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simulation data 






affine upper bound 






upper bound — ~ 






lower bound 

















0.5 1 1.5 



Fig. 13. Comparison of upper and lower bound of shortest paths {dij} with respect to the correct distance {dij} computed 
for n = 6000 sensors in 2-dimensional square [0, l] 2 under connectivity-based model. 



simulation data shows the distribution of shortest paths between all pairs of nodes with respect to the 
actual pairwise distances, which confirms that the shortest paths lie between the analytical upper and 
lower bounds. Although the gap between the upper and lower bound is seemingly large, in the regime 
where R = C \f\og n/n with a constant C, the vertical gap R vanishes as n goes to infinity and the slope 
of the affine upper bound can be made arbitrarily small by increasing the radio range R or equivalently 
taking large enough C. 

VII. Conclusion 

In many applications of wireless sensor networks, it is crucial to determine the location of nodes. For 
this matter, numerous algorithms have been recently proposed where the efficiency and success of them 
have been mostly demonstrated by simulations. In this paper, we have investigated the centralized and 
distributed sensor localization problem from a theoretical point of view and have provided analytical 
bounds on the performance of such algorithms. More precisely, we analysed the MDS-MAP and HOP- 
TERRAIN algorithms and showed that even when only the connectivity information was given and in 
the presence of detection failure, the resulting error of both algorithms is bounded and decays at a rate 
inversely proportional to the detection range. 
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Appendix 



Hoeffding's inequality MHoe63l is a result in probability theory that gives an upper bound on the 
probability for the sum of random variables to deviate from its expected value. Let z\ , Z2, ■ ■ ■ , z n be 
independent and bounded random variables such that G [afc,6fc] with probability one. Let s n = 
J2k=i z k- Then for any 5 > 0, we have 



The Gershgorin circle theorem [HJ85 ] identifies a region in the complex plane that contains all the 
eigenvalues of a complex square matrix. For an n x n matrix A, define 



3+i 

Then each eigenvalue of A is in at least one of the disks 

{z:\z- a,i t i\ < Ri}. 

Informally, concentration of Lipschitz functions says that any smooth function of bounded indepen- 
dent random variables is tightly concentrated around its expectation ULedOli The notion of smoothness 
we will use is Lipschitz. 

Definition A.l: f : IR n — > R is A-Lipschitz with respect to the l p norm, if for all x and y, 



It turns out that Hoeffding's bound holds for all Lipschitz (with respect to l\ norm) functions. More 
precisely, suppose X\, X2, ■ ■ ■ , X n are independent and bounded with aj < xi < b^. Then for any 
/ : W 1 — > R which is A-Lipschitz with respect to the l\ norm, 





\f(x)-f(g)\<X\\x-y 
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